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Abstract 

The history of computerisation of the Western Australian Museum's modern fauna collections is 
outlined (it was initiated in the late 1970's on the Treasury mainframe computer). The Museum is 
custodian of approximately 1.5 million specimens and specimen lots of modem fauna. The majority 
of specimens in its collections are from Western Australia, but the collections also include 
specimens from Indonesia and New Guinea. Currently only the vertebrates are comprehensively 
databased. The Museum's current primary collections database system is briefly described as well 
as its recently released on-line version - FaunaBase. 
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Historical background 

The first faunal specimens with associated data were 
accessioned into the Western Australian Museum 
collection in the 1890's. As was traditional with natural 
history museums, specimens were (and still are) 
arranged in collections corresponding to phylogenetic 
groupings (mammals, birds, arachnids, etc). The practice 
of assigning each specimen within a collection a unique 
registration number, recording associated data on a label, 
and entering duplicate and additional data into a hand 
written catalogue was started in the early 1900's. 
However, different curators adopted somewhat different 
methods of assigning registration numbers. In some 
collections the specimens were numbered sequentially, 
whereas in others they were numbered sequentially by 
year. Also, data accompanying many of the earliest 
records are often imprecise, particularly the collection 
locality (e.£. "Western Australia"). 

The first attempt to produce an electronic listing (but 
not a database in the modem sense) was in the late 1970's 
when a catalogue of the modem mammal collection was 
produced using the Treasury mainframe computer and 
involving punch cards and special programming. In 
1983, as a foundation member of the Western Australia 
Land Information System, the Museum received funding 
for an in-house UNIX server through the Government 
Computing Committee and the Museum employed a 
contract programmer (with a background in biology) to 
develop customised database software for the mammal, 
herptile, fish, arachnid and crustacean collections. This 
in-house system ran very successfully until the late 1980's 
when the Museum's inability to continue to support it 
prompted a switch to a commercial multi-user database 
system (INGRES), developed and supported externally. 
The Museum's collections databases were temporarily 
transferred to FOXPRO in about 1995, pending 
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availability of BIOLINK, a multi-user database designed 
for biological collections management and research 
under development by CSIRO. 

BIOLINK was not designed with registration number 
as a key field. For this and other reasons the Museum 
decided in 2000 to adapt the collections management 
system developed in Microsoft Access® by the 
Department of Agriculture for its entomology collection. 
This was done by Dr Graeme Christie and is the system 
currently in use and described below. 

The Museum's primary collections databases 

The approach taken was to develop a generic database 
system that accommodated the collections management 
requirements of all the Museum's fauna collections. This 
necessitated considerable effort to standardise fields and 
existing differences in data standards before all 
collections could be successfully migrated to the new 
system. Post migration required extensive validation of 
data. 

Currently there are eleven faunal collections databases 
operating on this generic Microsoft Access® system. The 

Table 1 


List of modern fauna collections in the Western Australian 
Museum, numbers of records databased and proportion of each 
collection databased. 


Collections 

Numbers of 
databased 
records 

Approximate 
percentages of 
collections databased 

Mammals 

49,734 

100 

Sub-fossil Mammals 

10,000 

15 

Birds 

33,822 

100 

Reptiles/Amphibians 

131,847 

100 

Fishes 

47,748 

100 

Arachnids 

46,666 

<10 

Insects 

34,183 

<10 

Molluscs 

22,400 

<10 

Crustaceans 

31,056 

30 

Worms 

4,277 

<10 

Other Marine Invertebrates 12,062 

<10 
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size of each, and degree to which data capture of each 
collection is complete varies and is summarised in Table 
1. The vertebrate collections are the most 
comprehensively databased, the invertebrate groups are 
the least (because of the larger numbers of taxa and 
poorer knowledge of the taxonomies). 

The Museum's primary collections databases were 
designed with ease-of-use being the primary aim without 
losing the potential to expand or accommodate new 
technology as it became available. Being based on 
Microsoft Access®, the database technology follows what 
has become an industry'' standard and as such is well 
understood and supported; an important consideration 
for ongoing development and maintenance. This 
technology also allows for immediate integration to a 
wide spectrum of other existing or newly developed 
computer technologies, in particular the Internet. 

The ability to rapidly and accurately enter large 
quantities of information into the primary databases has 
been addressed in the database design. Predefined forms 
are used to enter the data for one record at a time: each 
record is uniquely identified by its Registration Number, 
which doubles as the collection Lot number. Wherever 
feasible, for example, in the selection of taxonomic 
names, country, state, etc, look-up lists are provided for 
ease of data entry and integrity. Data validation is 
enforced at field level to avoid the entry of invalid or 
inaccurate data. Automated processes, such as the ability 
to automatically derive higher taxa from genus, or the 
selection of default values, have been implemented. An 
interface to an external gazetteer database provides the 
ability to rapidly obtain latitudes and longitudes from 
selected place names and/or offsets from these places, or 
vice versa. Automatic conversion from latitude to decimal 
is also implemented. 

An interface to a Tissues' database is provided in 
order to cater for the increasing need to use molecular 
technology to support taxonomy. Similarly, an interface 
to an Images database links specimen records to digitised 
images. 

A suite of utilities has been developed for rapid 
database maintenance. In order to ensure that collections 
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Figure 1. Example of primary collection database entry page 
(for mammals) showing the main menu options. 


database records align with taxonomic changes a 
program has been developed using Visual Basic. This 
program allows global changes to the database records 
whenever changes to taxonomic names are required or if 
taxa are incorrectly positioned within the taxonomic 
hierarchy. A taxonomic tree is generated based on the 
taxa within the database and any taxon can be moved to 
its correct location in the hierarchy using a "drag-and- 
drop" method. All SQL statements required to 
implement these global changes are automatically 
generated by the program. This utility can also generate 
a distribution map for any taxon. Each specimen is 
represented by a single dot on the map indicating its 
collection location. Underlying records for that taxon can 
then be interrogated and edited, if required, by selecting 
the dot or dots. A related program provides a read-only 
taxonomic tree/mapping interface. Another utility allows 
up to 20 related or unrelated taxa to be plotted 
superimposed on the same map for comparison. This 
program is useful for rapidly finding and editing records 
with incorrect geographical coordinates. 

Additional utilities have been developed to support 
the ongoing curation of collection specimens, such as a 
label generator, loans management, report production, 
and search facilities. The database design also handles 
interfaces to third party products, such as Arc View. 

The primary collections databases have been designed 
to accommodate observation records by standardising on 
table structure. This approach allows existing interfaces 
and utilities to work seamlessly over collections records 
and/or observations records (see Fig 1 for example of an 
entry page to one of the primary databases). 

The Museum's Secondary On-line Databases- 

FaunaList and FautiaBase 

(www. museum, wa .gov.au /faunabase) 

FaunaList is an on-line phylogenetically arranged 
checklist of the scientific names for each vertebrate group 
(amphibians, birds, mammals, reptiles and fishes) 
recorded from Western Australia and the surrounding 
seas. Based on a recently published checklist (Aplin et ah 
2001) it contains literature references, flags gazetted 
"threatened" fauna, and provides information on 
numbers of taxa and lists of type specimens in the 
Museum's collections. It is easily updated and it is the 
Museum's intention to keep it current. 

FaunaBase, which is a derivative of the primary fauna 
collections databases, has been developed as an on-line 
product tailored to meet the needs of the majority of users. 
Currently it provides access to records of more than 
170,000 terrestrial vertebrate specimen records and 32,965 
bird observational records of Western Australian taxa [i.e. 
it excludes records from overseas (the Museum holds 
extensive collections from south east Asia and New 
Guinea) and from the rest of Australia if the taxa do not 
occur in this State]. Recently FautiaBase has added 
terrestrial vertebrate records of the Queensland Museum, 
and Museum and Art Gallery of the Northern Territory. 
However, FautiaBase has the potential to deliver all data 
contained in all the primary databases 

Users of FautiaBase may access specimen records via a 
number of entry modes: common names, thumbnail 
images, a taxonomic tree. Family or a search on scientific 
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name. It is also possible to define and search a specified 
area to produce a listing of taxa recorded within it. All 
methods produce for each taxon a map showing the 
distribution of all Western Australian Museum specimen 
records in Australia, an image (if available) and a written 
description of its total distribution. Restricted access is 
possible to specimen records underlying mapped 
distribution points. 

FannaBase provides 'snapshots' of integrated 
information taken regularly from the Museum's primary 
collections databases. Meanwhile, the digital distribution 
maps for each species are not predefined but dynamically 
generated by the system's GIS. One of its strengths is its 
simplicity. The system draws on the nomenclature 
contained in the FaunaList database, a flat structure akin 
to spreadsheets. During searches of FaunaBase, the GIS 
superimposes geographic information on a base map of 
Australia using ASP and Java technology. The Museum's 
server uses ASP technology to extract information from 
the databases for three corresponding values: latitudinal 
coordinates, longitudinal coordinates and date category. 
These values are passed to a Java applet, which generates 
maps in the client's browser. Geographical coordinates 
are assigned simplified corresponding x/y decimal 
values, which are then identified and extracted by the 
applet. The date categories (50 year intervals) simply 
provide a breakdown of when the specimen was 
collected. To conduct an area search, users can either type 
in specific coordinates or click the mouse twice to define 
the top left and bottom right corners of the area to be 
searched; JavaScript is used to enable this function. It is 
recommended that FaunaBase be viewed using Internet 
Explorer version 4 or above with Cookies, JavaScript and 
Java enabled. 


Discussion 

Containing more than 1.5 million specimen records, 
the Museum's primary fauna collections databases are by 
far the largest taxonomically verifiable databases of the 
Western Australian fauna (they are taxonomically 
verifiable because each record is underpinned by a 
voucher specimen). Other strengths include their long 
temporal coverage, extensive (rather than intensive) 
geographical coverage of the State and the high degree 
of reliability of identification of taxa. Weaknesses include 
the mainly low spatial resolution (particularly of older 
records), they consist of point data reflecting presence 
only (not absence) and they are inadequately resourced, 
with the result that backlogs of data capture remain 
substantial for invertebrates. Very large observational 
databases, particularly for birds, also remain to be 
captured electronically. 

Development of FaunaBase has been a significant step 
towards on-line delivery of specimen and observational 
data contained in the Museum's primary databases. The 
quantity of fauna data potentially available should 
significantly increase in the near future through access to 
Western Australian fauna records currently held in the 
collections of other state museums and CSIRO. OZCAM 
(On-line Zoological Collections of Australian Museums) 
is a joint Commonwealth and State funded project that 
will enable distribution of fauna data held by these 
institutions. 
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